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Abstract 

The necessary and sufficient conditions of optimality of the decod- 
ing of quantum signals minimizing the Bayesian risk are generalized for 
the Shannon mutual information criteria. It is shown that for a linear 
channel with Gaussian boson noise these conditions are satisfied by co- 
herent quasi-measurement of the canonical annihilation amplitudes in the 
received superposition. 



1 Necessary and sufficient conditions of opti- 
mality 

In E] dealing with optimization of the reception of quantum signals such as 
electromagnetic waves in the optical band, the search for the necessary con- 
ditions of optimality in the class of randomized strategies based on indirect 
measurements was our main concern. According to this universal approach, we 
shall specify the randomized strategies by the operator probability measures 
II(d/3) corresponding to quasi-measurements of certain noncommuting observ- 
ables bj — J /3 -II(d/3) such that 

n(d/3) > 0, J n(d/3) = i. 

The equations derived in JSJE1 have the same form for both risk and information 
criteria of optimality: 

(i?(/?)-A)n(d/3) = 0, A = f R((3)H(d(3), (1.1) 

"Originally published in: Radio Eng. Electron. Phys., 19 (7), p. 39, 1974. [trans, from 
Radiotekhnika i Electronika, 1974, 19, 7, 1391-1395] 



where R{(3) = J c($, f3)p('3)P(d'&) is the "a posteriori" risk or nega-information 
operator in the Hilbert space of quantum states EL Here p(i?) is the family of 
density operators describing the state of the quantum channel depending on the 
transmitted information with a prior distribution P(di?) and c(/3, $) is a given 
penalty function in Bayes case and the random information 

^^ JPm^PiMV W)=™(<*)p(*) (1.2) 

with the opposite sign, c(d,(3) = — i(d,(3), for the optimization criterion of 
maximum Shannon information 

i ' , °// v l #5») f|fflf|i,) - 

It is obvious that operators U°(d(3) satisfying equation (|l.l|l are degenerate 
(provided R{0) - A / 0) and for each j3 have a range of values belonging 
to the zero eigensubspace of the difference R(j3) — A. If operators B{(3) = 
R{0) — A have a unique eigenvector ipg for each j3, corresponding to the zero 
eigenvalue, then the operator measure n°(d/3) is proportional to the projection 
operators W{dj3) = ip%ip%*d/3. In the general case when degeneration of the 
zero eigenvalue of operators B((3) is possible: B(/3)(pg v = 0, v 6 N(/3), each 
resolution of identity J II(d/3) = 1 satisfying Equation. Hl.lfl may be included 
in some more detailed resolution J tp <p*drf — 1 for 7 = {(3,v). For different 7 
the vectors (f^ describing the "elementary" measurements need not necessarily 
be orthogonal: (p*<p r * ^ 0. 

In Bayes case the sufficient conditions for optimality are very simple: the 
operators U°(d(3) satisfying Equation. (|1 . 1|) minimize the average risk 



R = (c(tf,/?)) = Try i?(/3)n(d/3) (1.3) 
if and only if the condition of nonnegative definitcness 

B((3) = R{fi) - I i?(/3')n°(d/3') > (1.4) 



is satisfied for all (3. Actually, for any other operators measure H(d/3) ^ U°(df3) 
the difference 



R - R° = Tr 



R((3)U(d(3) - / i?(/3)n°(d/3) 



= Tr J B(f3)um 



is nonnegative since it is the trace of a sum of products of nonnegative operators 
B(f3)MdP). 

Conditions ll.lfl and l|1.4fl are applicable also for the optimization of the 
processing of quantum signals according to the maximum likelihood criterion. 
For this it is sufficient to consider that this criterion can be formally taken as 
Bayes criterion with uniform (unnormalized) a priori distribution P(d$) = di9 
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and a simple penalty function c(/3,#) = — 5("3 — (3). This means that the a 
posteriori risk operator R(/3) should in this case be replaced by the density 
operator p(i9) at the estimate point $ = f3. We shall call the quantum strategies 
n°(d/3) satisfying conditions and (|1.4|) for i?(/3) = —p{0) optimum with 

respect to the maximum likelihood criterion. We shall give the solution of the 
problem of the discrimination of nonorthogonal signals for the following simplest 
case. 

In P the concept of coherent processing a boson* signal b = (b u ), i.e., of 
indirect linear measurement realized by measuring the superposition b + a*,, 
where oo is vacuum boson noise, was introduced. The question of the physical 
realization of this measurement was discussed at the Third All-Union Conference 
on the Physical Principles of Information Transmission by Laser Radiation. In 
particular it was shown that coherent measurement of a narrowband optical 
signal can be realized by using an ideal heterodyne reception (ideal count of 
photons at different points of superposition of the received and the reference 
waves). The backward vacuum wave radiated by an ideally matched receiver 
into the communication line plays the role of noise a®. 

In PP the quality of such processing was also defined from the maximum 
likelihood criterion for the case where b is the superposition of a coherent signal 
"& = {fly) and a Gaussian boson noise a. The use of equations l|l.lfl and (|1.4f> 
and a suitable representation of the density operator makes it obvious that the 
processing described by the coherent projectors 

n(d/3) = |/3) (/3|d M (/3), d/i(/3) = ni dRe ^ dIm ^ ( LS ) 

is optimal. Such a suitable representation of the density operator p{§) of the 
displaced Gaussian state b — $ + a is the representation in the form of the 
expression 

p(tf) = : exp{6-i?) t L- 1 (6-i?)} : (1.6) 

normally ordered with respect to the operators b* , b. Here L||(a„a*)|| is the 
correlation matrix of the noise a, the colon-brackets : • : denote normal order 
such that the operators a* act to the left after the operators a, and \L\ = detL. 

Putting R((3) — —p(fl),A = — and considering the well-known 0] 

properties J \0) ((3\dp(f3) — 1, 

:p(b*,b) : \(3)=p(b*,0)\f3) 

of the coherent vectors, we at once find that equation Ijl.H has a unique solution 
coinciding with (|1.5|) . The nonnegative definiteness of the operator 

B(/3) = : exp{ - (b - 4)1 L' 1 (b - 0)} : ) 

*We recall that we are giving the name boson signal to the quantum signal described by the 
operators {»„, a*} satisfying the commutation relations a. v ct v i —a^/a^ = 0, a. v a* v , —a*,a u = 
S vv i. In particular, the optical signal is described by the photon annihilation and creation 
operators a and a* respectively. 
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is beyond doubt. 

If the signals are known apart from the phase, the coherent processing 
is no longer optimum; however, it remains quasioptimal if the dimensionality of 
$ is large. 



2 Local optimality according to information cri- 
terion 

In the case of the information criterion it is difficult to get a global criterion of 
optimality of the solutions U°(d/3) of equation minimizing the "Shannon" 
risk R = — 1/3,1? in view of its nonlinear dependence on IT(d/3) [the "penalty 
function" [3] also depends on II(d/3)]. Therefore we give a differential criterion 
of optimality. 

We shall restrict the discussion to operator probabilities of degenerate form 
II(d/3) = (pptppdfi, where (ipp) is the complete family of vectors J (pptppdfi = 1. 
It can be shown that this is sufficient for the verification of local optimality 
S 2 R > of the degenerate solutions n°(d/3) = tp°p(p%*d/3 of Equation. (fTTT|). 

Making use of the dependence of the variations 5(pp — Lpp — Lp°p 

(<P°pfyp + fypVf + S<Pp8<p}) df3 = 0, 

it is not difficult to find the increment AR = R — R° of Shannon risk (|I.3|I (with 
the penalty function H1.2[l depending on ipp) at the stationary point <p% with an 
accuracy up to second-order terms in 

AR ~ J J jc(0,/3)<Jp(/?|0)P(d0) 

= | \(5\np(m) 2 - (Slnp(p)) 2 } p(/3|i?)p(dtf) 



-(^^ + ^ /3 ) 2 M/3)]}d/3, (2.1) 
where B{0) is the difference (II .4|) . 

and the vector ip p = pp o /p({3) (p(f3) = f p(/3\'d)P(d'd)) is the vector ipp{&) 
averaged with the Bayessian posterior density 

p(&\0)=p(P\d)p(ti)/ [ p(fl\d)P(d&). 



4 



A simple analysis of the positiveness 5 2 R > of the variation 12.1f> of Shan- 
non risk shows that in contrast to the Bayes case the nonnegativeness of the 
operators B(f3) > is necessary, but not sufficient for the local optimality of 
the solutions tp^ of the equation B{f3)ipp — 0: the additional term in (|2.1|l (in 
square brackets) has the meaning of a posteriori variance of the real random 
quantity 2Re<fy5^ / g($) and is generally positive. 

Let, for example, the density operator have the form (|1.6(l and the prior 
distribution P(d/ff) be Gaussian in the multidimensional space of the information 
parameters -d — ($ v ): 

P(dd) = \S\- X cxp{-tf 1 S^-d} dfi(D), dp(-d) = Y[^dRei) v dIm'& v . 

V 

We shall check the local optimality of the coherent solutions (|1.5ll of equa- 
tion in the boson Gaussian case according to the information criterion. As 
shown in the coherent vectors (|1.5|) satisfy Equation. 1)1. lfl and the operator 
B(f3) has a quadratic Gaussian form: 

B{J3) = (b- (3)^H P (b -l3),p=\L + SI' 1 ; e^+ST 1 * : , 

where the matrix H = L^ 1 — (S + L)^ 1 is not larger than unity in accordance 
with the inequalities S > 0, L > 1,0 < H < 1. Considering the analytic 
dependence of the function 

on $ (i.e., the independence on #*) and carrying out conditional averaging over 
i? in (E3J, 

with the density 

p{#\0) = |Af|exp{ - (§ — AfrfM(ft — A[i)}, 

where A = S(S + L^ 1 , M = S^ 1 + L^ 1 , we find that in the Gaussian case the 
variation S 2 R has the form 

S 2 R = J S^(B(/3)~D((3))S^ p dp((3), 

dpi(P)=]l$dBe0 v dTm0 v , 



where 

' 1 :>0, 



e -{b-0f{l-H){b-0) _ e -{b-P)*{b-l3) 



D(J3) =: P (b) 
p(b) = \S + L\~ l exp { - b\S + L)^b} 
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Thus, in order to prove the optimality of coherent quasi- measurement for the 
information criterion one should verify the operator inequality B{(5) — D{(3) > 
or 



P (l3)e 



-fci (S+L)- 1 . 



b^He- b ^ s+L ^ lb b 



- ( e - b ^ H -V b - e- 61 ' 6 ) ] : e -P(8+L)-H > Q (2 2) 

(here the change of variables b — (3 — > b has been carried out). The operator 
occurring on the left-hand side of 12.2fl has the structure A*^ : [•] : Ap and is 
positive only if the operator in the square brackets is positive. Considering that 
the operator inequality 

is satisfied by virtue of the matrix inequality (S + L)^ 1 = L~ 1 H < 1 — H, we 
find that inequality l|2.2|l is satisfied if 

: b^He b ^ H ' 1 H :>: e b ^ H ^ b - e~ bh : . (2.3) 

This inequality l|2.3|l becomes obvious in the diagonal representation in the 
occupation numbers n v : 

where h v are the eigenvalues of the matrix H = L^ 1 — (S + L)" 1 . For n v = 
both the left- and right-hand sides of inequality l|2. 3f) vanish. 
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